Dataset statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Number of variables | 9 | 9 |
| Number of observations | 188 | 862 |
| Missing cells | 0 | 0 |
| Missing cells (%) | 0.0% | 0.0% |
| Duplicate rows | 0 | 172 |
| Duplicate rows (%) | 0.0% | 20.0% |
| Total size in memory | 13.3 KiB | 67.3 KiB |
| Average record size in memory | 72.7 B | 80.0 B |
Variable types
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Categorical | 4 | 4 |
| Numeric | 5 | 5 |
| Original Dataset | Oversampled Dataset | |
|---|---|---|
time is highly overall correlated with distance | Alert not present in | High Correlation |
line_width is highly overall correlated with roughness | Alert not present in | High Correlation |
roughness is highly overall correlated with line_width | Alert not present in | High Correlation |
distance is highly overall correlated with time | Alert not present in | High Correlation |
ink_visco_cp is highly overall correlated with surface_tension_dyne_cm and 1 other fields | Alert not present in | High Correlation |
surface_tension_dyne_cm is highly overall correlated with ink_visco_cp and 1 other fields | Alert not present in | High Correlation |
ink _density is highly overall correlated with ink_visco_cp and 1 other fields | Alert not present in | High Correlation |
overspray has 8 (4.3%) zeros | overspray has 18 (2.1%) zeros | Zeros |
| Alert not present in | Dataset has 172 (20.0%) duplicate rows | Duplicates |
| Alert not present in | distance has a high cardinality: 93 distinct values | High Cardinality |
| Alert not present in | ink_visco_cp has a high cardinality: 213 distinct values | High Cardinality |
| Alert not present in | surface_tension_dyne_cm has a high cardinality: 213 distinct values | High Cardinality |
| Alert not present in | ink _density has a high cardinality: 51 distinct values | High Cardinality |
| Alert not present in | distance is highly imbalanced (55.8%) | Imbalance |
| Alert not present in | ink_visco_cp is highly imbalanced (57.3%) | Imbalance |
| Alert not present in | surface_tension_dyne_cm is highly imbalanced (57.3%) | Imbalance |
| Alert not present in | ink _density is highly imbalanced (58.9%) | Imbalance |
Reproduction
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Analysis started | 2023-04-24 01:57:27.668052 | 2023-04-24 01:57:31.897946 |
| Analysis finished | 2023-04-24 01:57:31.882988 | 2023-04-24 01:57:34.604883 |
| Duration | 4.21 seconds | 2.71 seconds |
| Software version | ydata-profiling vv4.1.2 | ydata-profiling vv4.1.2 |
| Download configuration | config.json | config.json |
distance
Categorical
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 3 | 93 |
| Distinct (%) | 1.6% | 10.8% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
| 900 | |
|---|---|
| 300 | |
| 270 | 2 |
| 900 | |
|---|---|
| 300 | |
| 911 | 8 |
| 887 | 8 |
| 270 | 6 |
| Other values (88) |
Length
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 3 | 3 |
| Mean length | 3 | 3 |
| Min length | 3 | 3 |
Characters and Unicode
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Total characters | 564 | 2586 |
| Distinct characters | 5 | 10 |
| Distinct categories | 1 | 1 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Unique | 0 | 45 ? |
| Unique (%) | 0.0% | 5.2% |
Sample
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| 1st row | 270 | 899 |
| 2nd row | 270 | 903 |
| 3rd row | 300 | 904 |
| 4th row | 300 | 900 |
| 5th row | 300 | 888 |
Common Values
| Value | Count | Frequency (%) |
| 900 | 139 | |
| 300 | 47 | 25.0% |
| 270 | 2 | 1.1% |
| Value | Count | Frequency (%) |
| 900 | 484 | |
| 300 | 174 | 20.2% |
| 911 | 8 | 0.9% |
| 887 | 8 | 0.9% |
| 270 | 6 | 0.7% |
| 902 | 6 | 0.7% |
| 904 | 5 | 0.6% |
| 923 | 5 | 0.6% |
| 890 | 5 | 0.6% |
| 912 | 5 | 0.6% |
| Other values (83) | 156 | 18.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
Original Dataset
Oversampled Dataset
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| 900 | 139 | |
| 300 | 47 | 25.0% |
| 270 | 2 | 1.1% |
| Value | Count | Frequency (%) |
| 900 | 484 | |
| 300 | 174 | 20.2% |
| 911 | 8 | 0.9% |
| 887 | 8 | 0.9% |
| 270 | 6 | 0.7% |
| 902 | 6 | 0.7% |
| 904 | 5 | 0.6% |
| 923 | 5 | 0.6% |
| 890 | 5 | 0.6% |
| 912 | 5 | 0.6% |
| Other values (83) | 156 | 18.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 374 | |
| 9 | 139 | 24.6% |
| 3 | 47 | 8.3% |
| 2 | 2 | 0.4% |
| 7 | 2 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 1380 | |
| 9 | 632 | |
| 3 | 220 | 8.5% |
| 8 | 123 | 4.8% |
| 2 | 71 | 2.7% |
| 1 | 57 | 2.2% |
| 7 | 44 | 1.7% |
| 4 | 24 | 0.9% |
| 6 | 18 | 0.7% |
| 5 | 17 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 564 |
| Value | Count | Frequency (%) |
| Decimal Number | 2586 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 374 | |
| 9 | 139 | 24.6% |
| 3 | 47 | 8.3% |
| 2 | 2 | 0.4% |
| 7 | 2 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 1380 | |
| 9 | 632 | |
| 3 | 220 | 8.5% |
| 8 | 123 | 4.8% |
| 2 | 71 | 2.7% |
| 1 | 57 | 2.2% |
| 7 | 44 | 1.7% |
| 4 | 24 | 0.9% |
| 6 | 18 | 0.7% |
| 5 | 17 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 564 |
| Value | Count | Frequency (%) |
| Common | 2586 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 374 | |
| 9 | 139 | 24.6% |
| 3 | 47 | 8.3% |
| 2 | 2 | 0.4% |
| 7 | 2 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 1380 | |
| 9 | 632 | |
| 3 | 220 | 8.5% |
| 8 | 123 | 4.8% |
| 2 | 71 | 2.7% |
| 1 | 57 | 2.2% |
| 7 | 44 | 1.7% |
| 4 | 24 | 0.9% |
| 6 | 18 | 0.7% |
| 5 | 17 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 564 |
| Value | Count | Frequency (%) |
| ASCII | 2586 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 374 | |
| 9 | 139 | 24.6% |
| 3 | 47 | 8.3% |
| 2 | 2 | 0.4% |
| 7 | 2 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 1380 | |
| 9 | 632 | |
| 3 | 220 | 8.5% |
| 8 | 123 | 4.8% |
| 2 | 71 | 2.7% |
| 1 | 57 | 2.2% |
| 7 | 44 | 1.7% |
| 4 | 24 | 0.9% |
| 6 | 18 | 0.7% |
| 5 | 17 | 0.7% |
time
Real number (ℝ)
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 63 | 434 |
| Distinct (%) | 33.5% | 50.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 71.138298 | 70.511514 |
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 31 | 28.356335 |
| Maximum | 130 | 130 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
Quantile statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 31 | 28.356335 |
| 5-th percentile | 34 | 33.627823 |
| Q1 | 45 | 45 |
| median | 69 | 69.433682 |
| Q3 | 89.25 | 88.686351 |
| 95-th percentile | 111.25 | 108 |
| Maximum | 130 | 130 |
| Range | 99 | 101.64366 |
| Interquartile range (IQR) | 44.25 | 43.686351 |
Descriptive statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Standard deviation | 24.68826 | 24.329103 |
| Coefficient of variation (CV) | 0.34704598 | 0.34503731 |
| Kurtosis | -0.82393317 | -0.8470009 |
| Mean | 71.138298 | 70.511514 |
| Median Absolute Deviation (MAD) | 21.5 | 19.554057 |
| Skewness | 0.16926931 | 0.10926534 |
| Sum | 13374 | 60780.925 |
| Variance | 609.51018 | 591.90525 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 78 | 9 | 4.8% |
| 44 | 9 | 4.8% |
| 66 | 8 | 4.3% |
| 96 | 8 | 4.3% |
| 38 | 8 | 4.3% |
| 63 | 7 | 3.7% |
| 107 | 6 | 3.2% |
| 61 | 6 | 3.2% |
| 83 | 5 | 2.7% |
| 60 | 5 | 2.7% |
| Other values (53) | 117 |
| Value | Count | Frequency (%) |
| 78 | 24 | 2.8% |
| 44 | 21 | 2.4% |
| 38 | 21 | 2.4% |
| 61 | 20 | 2.3% |
| 63 | 19 | 2.2% |
| 34 | 17 | 2.0% |
| 107 | 17 | 2.0% |
| 96 | 16 | 1.9% |
| 66 | 14 | 1.6% |
| 87 | 14 | 1.6% |
| Other values (424) | 679 |
| Value | Count | Frequency (%) |
| 31 | 2 | 1.1% |
| 32 | 4 | |
| 34 | 5 | |
| 35 | 1 | 0.5% |
| 36 | 2 | 1.1% |
| 37 | 2 | 1.1% |
| 38 | 8 | |
| 39 | 1 | 0.5% |
| 40 | 4 | |
| 41 | 2 | 1.1% |
| Value | Count | Frequency (%) |
| 28.35633542 | 1 | 0.1% |
| 30.10913703 | 1 | 0.1% |
| 31 | 5 | |
| 31.14085654 | 1 | 0.1% |
| 31.1857756 | 1 | 0.1% |
| 31.421662 | 1 | 0.1% |
| 31.7004337 | 1 | 0.1% |
| 31.76776959 | 1 | 0.1% |
| 32 | 11 | |
| 32.00413804 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 28.35633542 | 1 | 0.5% |
| 30.10913703 | 1 | 0.5% |
| 31 | 5 | |
| 31.14085654 | 1 | 0.5% |
| 31.1857756 | 1 | 0.5% |
| 31.421662 | 1 | 0.5% |
| 31.7004337 | 1 | 0.5% |
| 31.76776959 | 1 | 0.5% |
| 32 | 11 | |
| 32.00413804 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 31 | 2 | 0.2% |
| 32 | 4 | |
| 34 | 5 | |
| 35 | 1 | 0.1% |
| 36 | 2 | 0.2% |
| 37 | 2 | 0.2% |
| 38 | 8 | |
| 39 | 1 | 0.1% |
| 40 | 4 | |
| 41 | 2 | 0.2% |
velocity
Real number (ℝ)
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 73 | 447 |
| Distinct (%) | 38.8% | 51.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 10.462428 | 10.608104 |
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 6.667 | 6.667 |
| Maximum | 15.517241 | 15.517241 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
Quantile statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 6.667 | 6.667 |
| 5-th percentile | 6.818 | 6.923 |
| Q1 | 8.27675 | 8.411 |
| median | 9.945 | 10.112 |
| Q3 | 12.9035 | 13.01586 |
| 95-th percentile | 14.9139 | 14.896701 |
| Maximum | 15.517241 | 15.517241 |
| Range | 8.8502414 | 8.8502414 |
| Interquartile range (IQR) | 4.62675 | 4.6048605 |
Descriptive statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Standard deviation | 2.6390637 | 2.5766508 |
| Coefficient of variation (CV) | 0.25224199 | 0.24289457 |
| Kurtosis | -1.181831 | -1.2068823 |
| Mean | 10.462428 | 10.608104 |
| Median Absolute Deviation (MAD) | 2.05 | 2.0923557 |
| Skewness | 0.32023116 | 0.26939049 |
| Sum | 1966.9365 | 9144.1854 |
| Variance | 6.9646573 | 6.6391293 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 9.375 | 12 | 6.4% |
| 6.818 | 9 | 4.8% |
| 11.538 | 9 | 4.8% |
| 13.636 | 6 | 3.2% |
| 7.895 | 6 | 3.2% |
| 14.754 | 6 | 3.2% |
| 10.843 | 5 | 2.7% |
| 15 | 5 | 2.7% |
| 8.411214953 | 5 | 2.7% |
| 10.345 | 5 | 2.7% |
| Other values (63) | 120 |
| Value | Count | Frequency (%) |
| 9.375 | 27 | 3.1% |
| 11.538 | 24 | 2.8% |
| 6.818 | 21 | 2.4% |
| 14.754 | 20 | 2.3% |
| 7.895 | 16 | 1.9% |
| 10.345 | 14 | 1.6% |
| 8.411214953 | 14 | 1.6% |
| 10.843 | 12 | 1.4% |
| 14.28571429 | 12 | 1.4% |
| 8.333333333 | 12 | 1.4% |
| Other values (437) | 690 |
| Value | Count | Frequency (%) |
| 6.667 | 4 | |
| 6.818 | 9 | |
| 6.923 | 2 | 1.1% |
| 6.976744186 | 1 | 0.5% |
| 6.977 | 2 | 1.1% |
| 7.142857143 | 1 | 0.5% |
| 7.143 | 2 | 1.1% |
| 7.317 | 1 | 0.5% |
| 7.317073171 | 1 | 0.5% |
| 7.5 | 4 |
| Value | Count | Frequency (%) |
| 6.667 | 11 | |
| 6.683130991 | 1 | 0.1% |
| 6.73400987 | 1 | 0.1% |
| 6.752611279 | 1 | 0.1% |
| 6.774707324 | 1 | 0.1% |
| 6.818 | 21 | |
| 6.842750206 | 1 | 0.1% |
| 6.85928052 | 1 | 0.1% |
| 6.869708787 | 1 | 0.1% |
| 6.923 | 6 | 0.7% |
| Value | Count | Frequency (%) |
| 6.667 | 11 | |
| 6.683130991 | 1 | 0.5% |
| 6.73400987 | 1 | 0.5% |
| 6.752611279 | 1 | 0.5% |
| 6.774707324 | 1 | 0.5% |
| 6.818 | 21 | |
| 6.842750206 | 1 | 0.5% |
| 6.85928052 | 1 | 0.5% |
| 6.869708787 | 1 | 0.5% |
| 6.923 | 6 | 3.2% |
| Value | Count | Frequency (%) |
| 6.667 | 4 | |
| 6.818 | 9 | |
| 6.923 | 2 | 0.2% |
| 6.976744186 | 1 | 0.1% |
| 6.977 | 2 | 0.2% |
| 7.142857143 | 1 | 0.1% |
| 7.143 | 2 | 0.2% |
| 7.317 | 1 | 0.1% |
| 7.317073171 | 1 | 0.1% |
| 7.5 | 4 |
ink_visco_cp
Categorical
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 2 | 213 |
| Distinct (%) | 1.1% | 24.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
| 6.9 | |
|---|---|
| 6.3 |
| 6.9 | |
|---|---|
| 6.3 | |
| 6.909597767929222 | 1 |
| 6.292210172530286 | 1 |
| 6.919260921007547 | 1 |
| Other values (208) |
Length
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Max length | 3 | 18 |
| Median length | 3 | 3 |
| Mean length | 3 | 6.4095128 |
| Min length | 3 | 3 |
Characters and Unicode
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Total characters | 564 | 5525 |
| Distinct characters | 4 | 11 |
| Distinct categories | 2 | 2 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Unique | 0 | 211 ? |
| Unique (%) | 0.0% | 24.5% |
Sample
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| 1st row | 6.3 | 6.909597767929222 |
| 2nd row | 6.3 | 6.87655460358194 |
| 3rd row | 6.3 | 6.890437650474096 |
| 4th row | 6.3 | 6.9 |
| 5th row | 6.3 | 6.903606919171564 |
Common Values
| Value | Count | Frequency (%) |
| 6.9 | 140 | |
| 6.3 | 48 | 25.5% |
| Value | Count | Frequency (%) |
| 6.9 | 491 | |
| 6.3 | 160 | 18.6% |
| 6.909597767929222 | 1 | 0.1% |
| 6.292210172530286 | 1 | 0.1% |
| 6.919260921007547 | 1 | 0.1% |
| 6.903671923663459 | 1 | 0.1% |
| 6.868028799230724 | 1 | 0.1% |
| 6.928740183047567 | 1 | 0.1% |
| 6.885948147225283 | 1 | 0.1% |
| 6.88772432365691 | 1 | 0.1% |
| Other values (203) | 203 |
Length
Histogram of lengths of the category
Common Values (Plot)
Original Dataset
Oversampled Dataset
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| 6.9 | 140 | |
| 6.3 | 48 | 25.5% |
| Value | Count | Frequency (%) |
| 6.9 | 491 | |
| 6.3 | 160 | 18.6% |
| 6.883473278255356 | 1 | 0.1% |
| 6.90325049496801 | 1 | 0.1% |
| 6.895028195237112 | 1 | 0.1% |
| 6.880003517882575 | 1 | 0.1% |
| 6.881905441518298 | 1 | 0.1% |
| 6.890437650474096 | 1 | 0.1% |
| 6.903606919171564 | 1 | 0.1% |
| 6.882529066702848 | 1 | 0.1% |
| Other values (203) | 203 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 188 | |
| . | 188 | |
| 9 | 140 | |
| 3 | 48 | 8.5% |
| Value | Count | Frequency (%) |
| 6 | 1165 | |
| . | 862 | |
| 9 | 852 | |
| 3 | 457 | 8.3% |
| 8 | 360 | 6.5% |
| 7 | 333 | 6.0% |
| 2 | 333 | 6.0% |
| 1 | 309 | 5.6% |
| 5 | 302 | 5.5% |
| 0 | 289 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 376 | |
| Other Punctuation | 188 |
| Value | Count | Frequency (%) |
| Decimal Number | 4663 | |
| Other Punctuation | 862 | 15.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 188 | |
| 9 | 140 | |
| 3 | 48 | 12.8% |
| Value | Count | Frequency (%) |
| 6 | 1165 | |
| 9 | 852 | |
| 3 | 457 | 9.8% |
| 8 | 360 | 7.7% |
| 7 | 333 | 7.1% |
| 2 | 333 | 7.1% |
| 1 | 309 | 6.6% |
| 5 | 302 | 6.5% |
| 0 | 289 | 6.2% |
| 4 | 263 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 188 |
| Value | Count | Frequency (%) |
| . | 862 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 564 |
| Value | Count | Frequency (%) |
| Common | 5525 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 188 | |
| . | 188 | |
| 9 | 140 | |
| 3 | 48 | 8.5% |
| Value | Count | Frequency (%) |
| 6 | 1165 | |
| . | 862 | |
| 9 | 852 | |
| 3 | 457 | 8.3% |
| 8 | 360 | 6.5% |
| 7 | 333 | 6.0% |
| 2 | 333 | 6.0% |
| 1 | 309 | 5.6% |
| 5 | 302 | 5.5% |
| 0 | 289 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 564 |
| Value | Count | Frequency (%) |
| ASCII | 5525 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 188 | |
| . | 188 | |
| 9 | 140 | |
| 3 | 48 | 8.5% |
| Value | Count | Frequency (%) |
| 6 | 1165 | |
| . | 862 | |
| 9 | 852 | |
| 3 | 457 | 8.3% |
| 8 | 360 | 6.5% |
| 7 | 333 | 6.0% |
| 2 | 333 | 6.0% |
| 1 | 309 | 5.6% |
| 5 | 302 | 5.5% |
| 0 | 289 | 5.2% |
surface_tension_dyne_cm
Categorical
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 2 | 213 |
| Distinct (%) | 1.1% | 24.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
| 32.3 | |
|---|---|
| 30.9 |
| 32.3 | |
|---|---|
| 30.9 | |
| 32.35628987700318 | 1 |
| 30.938845924874386 | 1 |
| 32.26575526389176 | 1 |
| Other values (208) |
Length
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Max length | 4 | 18 |
| Median length | 4 | 4 |
| Mean length | 4 | 7.2703016 |
| Min length | 4 | 4 |
Characters and Unicode
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Total characters | 752 | 6267 |
| Distinct characters | 5 | 11 |
| Distinct categories | 2 | 2 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Unique | 0 | 211 ? |
| Unique (%) | 0.0% | 24.5% |
Sample
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| 1st row | 30.9 | 32.35628987700318 |
| 2nd row | 30.9 | 32.281981516487974 |
| 3rd row | 30.9 | 32.278331835047496 |
| 4th row | 30.9 | 32.3 |
| 5th row | 30.9 | 32.284442353737056 |
Common Values
| Value | Count | Frequency (%) |
| 32.3 | 140 | |
| 30.9 | 48 | 25.5% |
| Value | Count | Frequency (%) |
| 32.3 | 491 | |
| 30.9 | 160 | 18.6% |
| 32.35628987700318 | 1 | 0.1% |
| 30.938845924874386 | 1 | 0.1% |
| 32.26575526389176 | 1 | 0.1% |
| 32.36777590272072 | 1 | 0.1% |
| 32.27836696945391 | 1 | 0.1% |
| 32.308064267306214 | 1 | 0.1% |
| 32.304756248394845 | 1 | 0.1% |
| 32.28503124123783 | 1 | 0.1% |
| Other values (203) | 203 |
Length
Histogram of lengths of the category
Common Values (Plot)
Original Dataset
Oversampled Dataset
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| 32.3 | 140 | |
| 30.9 | 48 | 25.5% |
| Value | Count | Frequency (%) |
| 32.3 | 491 | |
| 30.9 | 160 | 18.6% |
| 32.36274053364617 | 1 | 0.1% |
| 32.26934184247805 | 1 | 0.1% |
| 32.33727500945055 | 1 | 0.1% |
| 32.33752823600665 | 1 | 0.1% |
| 32.361704761946086 | 1 | 0.1% |
| 32.278331835047496 | 1 | 0.1% |
| 32.284442353737056 | 1 | 0.1% |
| 32.24348408032512 | 1 | 0.1% |
| Other values (203) | 203 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 328 | |
| . | 188 | |
| 2 | 140 | |
| 0 | 48 | 6.4% |
| 9 | 48 | 6.4% |
| Value | Count | Frequency (%) |
| 3 | 1686 | |
| 2 | 953 | |
| . | 862 | |
| 0 | 476 | 7.6% |
| 9 | 465 | 7.4% |
| 6 | 330 | 5.3% |
| 8 | 310 | 4.9% |
| 1 | 307 | 4.9% |
| 5 | 299 | 4.8% |
| 7 | 292 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 564 | |
| Other Punctuation | 188 | 25.0% |
| Value | Count | Frequency (%) |
| Decimal Number | 5405 | |
| Other Punctuation | 862 | 13.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 328 | |
| 2 | 140 | |
| 0 | 48 | 8.5% |
| 9 | 48 | 8.5% |
| Value | Count | Frequency (%) |
| 3 | 1686 | |
| 2 | 953 | |
| 0 | 476 | 8.8% |
| 9 | 465 | 8.6% |
| 6 | 330 | 6.1% |
| 8 | 310 | 5.7% |
| 1 | 307 | 5.7% |
| 5 | 299 | 5.5% |
| 7 | 292 | 5.4% |
| 4 | 287 | 5.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 188 |
| Value | Count | Frequency (%) |
| . | 862 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 752 |
| Value | Count | Frequency (%) |
| Common | 6267 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 328 | |
| . | 188 | |
| 2 | 140 | |
| 0 | 48 | 6.4% |
| 9 | 48 | 6.4% |
| Value | Count | Frequency (%) |
| 3 | 1686 | |
| 2 | 953 | |
| . | 862 | |
| 0 | 476 | 7.6% |
| 9 | 465 | 7.4% |
| 6 | 330 | 5.3% |
| 8 | 310 | 4.9% |
| 1 | 307 | 4.9% |
| 5 | 299 | 4.8% |
| 7 | 292 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 752 |
| Value | Count | Frequency (%) |
| ASCII | 6267 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 328 | |
| . | 188 | |
| 2 | 140 | |
| 0 | 48 | 6.4% |
| 9 | 48 | 6.4% |
| Value | Count | Frequency (%) |
| 3 | 1686 | |
| 2 | 953 | |
| . | 862 | |
| 0 | 476 | 7.6% |
| 9 | 465 | 7.4% |
| 6 | 330 | 5.3% |
| 8 | 310 | 4.9% |
| 1 | 307 | 4.9% |
| 5 | 299 | 4.8% |
| 7 | 292 | 4.7% |
ink _density
Categorical
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 2 | 51 |
| Distinct (%) | 1.1% | 5.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
| 1614 | |
|---|---|
| 1517 |
| 1614 | |
|---|---|
| 1517 | |
| 1613 | 21 |
| 1615 | 15 |
| 1612 | 13 |
| Other values (46) |
Length
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Max length | 4 | 4 |
| Median length | 4 | 4 |
| Mean length | 4 | 4 |
| Min length | 4 | 4 |
Characters and Unicode
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Total characters | 752 | 3448 |
| Distinct characters | 5 | 10 |
| Distinct categories | 1 | 1 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Unique | 0 | 25 ? |
| Unique (%) | 0.0% | 2.9% |
Sample
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| 1st row | 1517 | 1614 |
| 2nd row | 1517 | 1612 |
| 3rd row | 1517 | 1610 |
| 4th row | 1517 | 1614 |
| 5th row | 1517 | 1612 |
Common Values
| Value | Count | Frequency (%) |
| 1614 | 140 | |
| 1517 | 48 | 25.5% |
| Value | Count | Frequency (%) |
| 1614 | 519 | |
| 1517 | 173 | 20.1% |
| 1613 | 21 | 2.4% |
| 1615 | 15 | 1.7% |
| 1612 | 13 | 1.5% |
| 1616 | 13 | 1.5% |
| 1611 | 9 | 1.0% |
| 1609 | 7 | 0.8% |
| 1515 | 6 | 0.7% |
| 1610 | 6 | 0.7% |
| Other values (41) | 80 | 9.3% |
Length
Histogram of lengths of the category
Common Values (Plot)
Original Dataset
Oversampled Dataset
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| 1614 | 140 | |
| 1517 | 48 | 25.5% |
| Value | Count | Frequency (%) |
| 1614 | 519 | |
| 1517 | 173 | 20.1% |
| 1613 | 21 | 2.4% |
| 1615 | 15 | 1.7% |
| 1612 | 13 | 1.5% |
| 1616 | 13 | 1.5% |
| 1611 | 9 | 1.0% |
| 1609 | 7 | 0.8% |
| 1518 | 6 | 0.7% |
| 1617 | 6 | 0.7% |
| Other values (41) | 80 | 9.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 376 | |
| 6 | 140 | 18.6% |
| 4 | 140 | 18.6% |
| 5 | 48 | 6.4% |
| 7 | 48 | 6.4% |
| Value | Count | Frequency (%) |
| 1 | 1690 | |
| 6 | 651 | 18.9% |
| 4 | 525 | 15.2% |
| 5 | 266 | 7.7% |
| 7 | 181 | 5.2% |
| 2 | 34 | 1.0% |
| 3 | 32 | 0.9% |
| 0 | 29 | 0.8% |
| 9 | 20 | 0.6% |
| 8 | 20 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 752 |
| Value | Count | Frequency (%) |
| Decimal Number | 3448 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 376 | |
| 6 | 140 | 18.6% |
| 4 | 140 | 18.6% |
| 5 | 48 | 6.4% |
| 7 | 48 | 6.4% |
| Value | Count | Frequency (%) |
| 1 | 1690 | |
| 6 | 651 | 18.9% |
| 4 | 525 | 15.2% |
| 5 | 266 | 7.7% |
| 7 | 181 | 5.2% |
| 2 | 34 | 1.0% |
| 3 | 32 | 0.9% |
| 0 | 29 | 0.8% |
| 9 | 20 | 0.6% |
| 8 | 20 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 752 |
| Value | Count | Frequency (%) |
| Common | 3448 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 376 | |
| 6 | 140 | 18.6% |
| 4 | 140 | 18.6% |
| 5 | 48 | 6.4% |
| 7 | 48 | 6.4% |
| Value | Count | Frequency (%) |
| 1 | 1690 | |
| 6 | 651 | 18.9% |
| 4 | 525 | 15.2% |
| 5 | 266 | 7.7% |
| 7 | 181 | 5.2% |
| 2 | 34 | 1.0% |
| 3 | 32 | 0.9% |
| 0 | 29 | 0.8% |
| 9 | 20 | 0.6% |
| 8 | 20 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 752 |
| Value | Count | Frequency (%) |
| ASCII | 3448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 376 | |
| 6 | 140 | 18.6% |
| 4 | 140 | 18.6% |
| 5 | 48 | 6.4% |
| 7 | 48 | 6.4% |
| Value | Count | Frequency (%) |
| 1 | 1690 | |
| 6 | 651 | 18.9% |
| 4 | 525 | 15.2% |
| 5 | 266 | 7.7% |
| 7 | 181 | 5.2% |
| 2 | 34 | 1.0% |
| 3 | 32 | 0.9% |
| 0 | 29 | 0.8% |
| 9 | 20 | 0.6% |
| 8 | 20 | 0.6% |
line_width
Real number (ℝ)
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 100 | 162 |
| Distinct (%) | 53.2% | 18.8% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 229.3617 | 252.35847 |
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 112 | 112 |
| Maximum | 391 | 457 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
Quantile statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 112 | 112 |
| 5-th percentile | 179 | 183 |
| Q1 | 194 | 208 |
| median | 222.5 | 253 |
| Q3 | 260 | 294 |
| 95-th percentile | 305.65 | 322.95 |
| Maximum | 391 | 457 |
| Range | 279 | 345 |
| Interquartile range (IQR) | 66 | 86 |
Descriptive statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Standard deviation | 43.833631 | 51.6546 |
| Coefficient of variation (CV) | 0.19111138 | 0.20468741 |
| Kurtosis | 0.33187771 | 0.25096912 |
| Mean | 229.3617 | 252.35847 |
| Median Absolute Deviation (MAD) | 31.5 | 43 |
| Skewness | 0.57550192 | 0.38003583 |
| Sum | 43120 | 217533 |
| Variance | 1921.3872 | 2668.1977 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 191 | 5 | 2.7% |
| 194 | 4 | 2.1% |
| 183 | 4 | 2.1% |
| 193 | 4 | 2.1% |
| 224 | 4 | 2.1% |
| 203 | 4 | 2.1% |
| 232 | 4 | 2.1% |
| 204 | 4 | 2.1% |
| 207 | 4 | 2.1% |
| 185 | 4 | 2.1% |
| Other values (90) | 147 |
| Value | Count | Frequency (%) |
| 303 | 17 | 2.0% |
| 321 | 14 | 1.6% |
| 305 | 14 | 1.6% |
| 306 | 13 | 1.5% |
| 218 | 13 | 1.5% |
| 194 | 12 | 1.4% |
| 225 | 12 | 1.4% |
| 191 | 12 | 1.4% |
| 224 | 12 | 1.4% |
| 203 | 12 | 1.4% |
| Other values (152) | 731 |
| Value | Count | Frequency (%) |
| 112 | 1 | 0.5% |
| 123 | 1 | 0.5% |
| 142 | 1 | 0.5% |
| 163 | 1 | 0.5% |
| 167 | 1 | 0.5% |
| 176 | 1 | 0.5% |
| 177 | 1 | 0.5% |
| 178 | 1 | 0.5% |
| 179 | 3 | |
| 180 | 2 |
| Value | Count | Frequency (%) |
| 112 | 3 | 0.3% |
| 123 | 3 | 0.3% |
| 142 | 1 | 0.1% |
| 163 | 3 | 0.3% |
| 167 | 2 | 0.2% |
| 176 | 3 | 0.3% |
| 177 | 2 | 0.2% |
| 178 | 3 | 0.3% |
| 179 | 8 | |
| 180 | 5 |
| Value | Count | Frequency (%) |
| 112 | 3 | 1.6% |
| 123 | 3 | 1.6% |
| 142 | 1 | 0.5% |
| 163 | 3 | 1.6% |
| 167 | 2 | 1.1% |
| 176 | 3 | 1.6% |
| 177 | 2 | 1.1% |
| 178 | 3 | 1.6% |
| 179 | 8 | |
| 180 | 5 |
| Value | Count | Frequency (%) |
| 112 | 1 | 0.1% |
| 123 | 1 | 0.1% |
| 142 | 1 | 0.1% |
| 163 | 1 | 0.1% |
| 167 | 1 | 0.1% |
| 176 | 1 | 0.1% |
| 177 | 1 | 0.1% |
| 178 | 1 | 0.1% |
| 179 | 3 | |
| 180 | 2 |
overspray
Real number (ℝ)
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 119 | 265 |
| Distinct (%) | 63.3% | 30.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 104.83511 | 139.60441 |
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 415 | 423 |
| Zeros | 8 | 18 |
| Zeros (%) | 4.3% | 2.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
Quantile statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 1 | 3 |
| Q1 | 16 | 33 |
| median | 59 | 99 |
| Q3 | 169 | 241 |
| 95-th percentile | 341.6 | 372 |
| Maximum | 415 | 423 |
| Range | 415 | 423 |
| Interquartile range (IQR) | 153 | 208 |
Descriptive statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Standard deviation | 110.08344 | 122.40498 |
| Coefficient of variation (CV) | 1.0500627 | 0.87679878 |
| Kurtosis | 0.28180928 | -0.86703726 |
| Mean | 104.83511 | 139.60441 |
| Median Absolute Deviation (MAD) | 49 | 87 |
| Skewness | 1.1385965 | 0.63833792 |
| Sum | 19709 | 120339 |
| Variance | 12118.363 | 14982.978 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 8 | 4.3% |
| 10 | 5 | 2.7% |
| 91 | 5 | 2.7% |
| 3 | 5 | 2.7% |
| 7 | 4 | 2.1% |
| 47 | 4 | 2.1% |
| 32 | 4 | 2.1% |
| 220 | 4 | 2.1% |
| 24 | 3 | 1.6% |
| 5 | 3 | 1.6% |
| Other values (109) | 143 |
| Value | Count | Frequency (%) |
| 0 | 18 | 2.1% |
| 91 | 15 | 1.7% |
| 7 | 14 | 1.6% |
| 10 | 12 | 1.4% |
| 201 | 11 | 1.3% |
| 11 | 10 | 1.2% |
| 3 | 10 | 1.2% |
| 34 | 10 | 1.2% |
| 32 | 10 | 1.2% |
| 220 | 10 | 1.2% |
| Other values (255) | 742 |
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 1 | 3 | 1.6% |
| 2 | 3 | 1.6% |
| 3 | 5 | |
| 4 | 1 | 0.5% |
| 5 | 3 | 1.6% |
| 6 | 1 | 0.5% |
| 7 | 4 | |
| 8 | 2 | 1.1% |
| 9 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 1 | 9 | |
| 2 | 9 | |
| 3 | 10 | |
| 4 | 4 | 0.5% |
| 5 | 9 | |
| 6 | 4 | 0.5% |
| 7 | 14 | |
| 8 | 6 | 0.7% |
| 9 | 4 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 1 | 9 | |
| 2 | 9 | |
| 3 | 10 | |
| 4 | 4 | 2.1% |
| 5 | 9 | |
| 6 | 4 | 2.1% |
| 7 | 14 | |
| 8 | 6 | 3.2% |
| 9 | 4 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 1 | 3 | 0.3% |
| 2 | 3 | 0.3% |
| 3 | 5 | |
| 4 | 1 | 0.1% |
| 5 | 3 | 0.3% |
| 6 | 1 | 0.1% |
| 7 | 4 | |
| 8 | 2 | 0.2% |
| 9 | 1 | 0.1% |
roughness
Real number (ℝ)
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Distinct | 94 | 131 |
| Distinct (%) | 50.0% | 15.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 98.037234 | 112.28422 |
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 43 | 43 |
| Maximum | 192 | 228 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 1.6 KiB | 13.5 KiB |
Quantile statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Minimum | 43 | 43 |
| 5-th percentile | 58.35 | 63.05 |
| Q1 | 75 | 82 |
| median | 91 | 112 |
| Q3 | 117.25 | 142 |
| 95-th percentile | 152.65 | 164 |
| Maximum | 192 | 228 |
| Range | 149 | 185 |
| Interquartile range (IQR) | 42.25 | 60 |
Descriptive statistics
| Original Dataset | Oversampled Dataset | |
|---|---|---|
| Standard deviation | 30.766043 | 34.582315 |
| Coefficient of variation (CV) | 0.31381998 | 0.30798909 |
| Kurtosis | 0.11304357 | -0.8400436 |
| Mean | 98.037234 | 112.28422 |
| Median Absolute Deviation (MAD) | 19 | 30 |
| Skewness | 0.73342434 | 0.12559766 |
| Sum | 18431 | 96789 |
| Variance | 946.54941 | 1195.9365 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 77 | 8 | 4.3% |
| 68 | 8 | 4.3% |
| 73 | 6 | 3.2% |
| 99 | 6 | 3.2% |
| 84 | 6 | 3.2% |
| 75 | 5 | 2.7% |
| 85 | 5 | 2.7% |
| 108 | 5 | 2.7% |
| 72 | 5 | 2.7% |
| 117 | 4 | 2.1% |
| Other values (84) | 130 |
| Value | Count | Frequency (%) |
| 77 | 25 | 2.9% |
| 68 | 21 | 2.4% |
| 145 | 18 | 2.1% |
| 118 | 17 | 2.0% |
| 99 | 17 | 2.0% |
| 108 | 16 | 1.9% |
| 84 | 16 | 1.9% |
| 71 | 15 | 1.7% |
| 73 | 15 | 1.7% |
| 85 | 15 | 1.7% |
| Other values (121) | 687 |
| Value | Count | Frequency (%) |
| 43 | 1 | |
| 44 | 1 | |
| 45 | 1 | |
| 48 | 2 | |
| 49 | 2 | |
| 54 | 1 | |
| 57 | 1 | |
| 58 | 1 | |
| 59 | 1 | |
| 60 | 1 |
| Value | Count | Frequency (%) |
| 43 | 3 | |
| 44 | 3 | |
| 45 | 4 | |
| 46 | 2 | 0.2% |
| 48 | 3 | |
| 49 | 5 | |
| 50 | 1 | 0.1% |
| 51 | 2 | 0.2% |
| 54 | 4 | |
| 56 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 43 | 3 | |
| 44 | 3 | |
| 45 | 4 | |
| 46 | 2 | 1.1% |
| 48 | 3 | |
| 49 | 5 | |
| 50 | 1 | 0.5% |
| 51 | 2 | 1.1% |
| 54 | 4 | |
| 56 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 43 | 1 | |
| 44 | 1 | |
| 45 | 1 | |
| 48 | 2 | |
| 49 | 2 | |
| 54 | 1 | |
| 57 | 1 | |
| 58 | 1 | |
| 59 | 1 | |
| 60 | 1 |
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
Original Dataset
Oversampled Dataset
| time | velocity | line_width | overspray | roughness | distance | ink_visco_cp | surface_tension_dyne_cm | ink _density | |
|---|---|---|---|---|---|---|---|---|---|
| time | 1.000 | 0.023 | -0.042 | -0.067 | -0.122 | 0.687 | 0.275 | 0.275 | 0.275 |
| velocity | 0.023 | 1.000 | 0.300 | 0.062 | 0.136 | 0.482 | 0.278 | 0.278 | 0.278 |
| line_width | -0.042 | 0.300 | 1.000 | 0.290 | 0.619 | 0.000 | 0.000 | 0.000 | 0.000 |
| overspray | -0.067 | 0.062 | 0.290 | 1.000 | 0.229 | 0.000 | 0.198 | 0.198 | 0.198 |
| roughness | -0.122 | 0.136 | 0.619 | 0.229 | 1.000 | 0.202 | 0.271 | 0.271 | 0.271 |
| distance | 0.687 | 0.482 | 0.000 | 0.000 | 0.202 | 1.000 | 0.151 | 0.151 | 0.151 |
| ink_visco_cp | 0.275 | 0.278 | 0.000 | 0.198 | 0.271 | 0.151 | 1.000 | 0.986 | 0.986 |
| surface_tension_dyne_cm | 0.275 | 0.278 | 0.000 | 0.198 | 0.271 | 0.151 | 0.986 | 1.000 | 0.986 |
| ink _density | 0.275 | 0.278 | 0.000 | 0.198 | 0.271 | 0.151 | 0.986 | 0.986 | 1.000 |
Original Dataset
A simple visualization of nullity by column.
Oversampled Dataset
A simple visualization of nullity by column.
Original Dataset
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Oversampled Dataset
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Original Dataset
| distance | time | velocity | ink_visco_cp | surface_tension_dyne_cm | ink _density | line_width | overspray | roughness | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 270 | 34.0 | 7.941 | 6.3 | 30.9 | 1517 | 294 | 12 | 164 |
| 1 | 270 | 34.0 | 7.941 | 6.3 | 30.9 | 1517 | 261 | 136 | 141 |
| 2 | 300 | 38.0 | 7.895 | 6.3 | 30.9 | 1517 | 218 | 11 | 103 |
| 3 | 300 | 44.0 | 6.818 | 6.3 | 30.9 | 1517 | 190 | 15 | 68 |
| 4 | 300 | 41.0 | 7.317 | 6.3 | 30.9 | 1517 | 190 | 91 | 90 |
| 5 | 300 | 40.0 | 7.500 | 6.9 | 32.3 | 1614 | 180 | 0 | 62 |
| 6 | 300 | 38.0 | 7.895 | 6.9 | 32.3 | 1614 | 178 | 80 | 82 |
| 7 | 300 | 43.0 | 6.977 | 6.3 | 30.9 | 1517 | 185 | 24 | 145 |
| 8 | 300 | 43.0 | 6.977 | 6.3 | 30.9 | 1517 | 213 | 50 | 161 |
| 9 | 300 | 34.0 | 8.824 | 6.3 | 30.9 | 1517 | 323 | 8 | 171 |
Oversampled Dataset
| distance | time | velocity | ink_visco_cp | surface_tension_dyne_cm | ink _density | line_width | overspray | roughness | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 899 | 62.214499 | 14.042099 | 6.909598 | 32.356290 | 1614 | 285 | 103 | 118 |
| 1 | 903 | 61.873446 | 14.571294 | 6.876555 | 32.281982 | 1612 | 284 | 105 | 119 |
| 2 | 904 | 64.223973 | 14.172891 | 6.890438 | 32.278332 | 1610 | 286 | 111 | 116 |
| 3 | 900 | 70.411670 | 13.147105 | 6.900000 | 32.300000 | 1614 | 287 | 94 | 118 |
| 4 | 888 | 61.526920 | 14.358877 | 6.903607 | 32.284442 | 1612 | 285 | 114 | 117 |
| 5 | 900 | 86.481936 | 10.717031 | 6.900000 | 32.300000 | 1614 | 287 | 106 | 93 |
| 6 | 923 | 93.496942 | 9.655995 | 6.882529 | 32.243484 | 1613 | 291 | 105 | 88 |
| 7 | 914 | 94.996984 | 9.750736 | 6.886091 | 32.261477 | 1619 | 288 | 107 | 84 |
| 8 | 904 | 93.024731 | 9.518223 | 6.908064 | 32.288811 | 1608 | 287 | 109 | 87 |
| 9 | 900 | 92.731914 | 9.767186 | 6.900000 | 32.300000 | 1614 | 287 | 106 | 86 |
Original Dataset
| distance | time | velocity | ink_visco_cp | surface_tension_dyne_cm | ink _density | line_width | overspray | roughness | |
|---|---|---|---|---|---|---|---|---|---|
| 178 | 900 | 108.0 | 8.333333 | 6.9 | 32.3 | 1614 | 212 | 282 | 72 |
| 179 | 900 | 93.0 | 9.677419 | 6.9 | 32.3 | 1614 | 323 | 47 | 157 |
| 180 | 900 | 93.0 | 9.677419 | 6.9 | 32.3 | 1614 | 305 | 201 | 108 |
| 181 | 900 | 94.0 | 9.574468 | 6.9 | 32.3 | 1614 | 288 | 107 | 85 |
| 182 | 900 | 95.0 | 9.473684 | 6.9 | 32.3 | 1614 | 290 | 24 | 115 |
| 183 | 900 | 96.0 | 9.375000 | 6.9 | 32.3 | 1614 | 262 | 17 | 94 |
| 184 | 900 | 96.0 | 9.375000 | 6.9 | 32.3 | 1614 | 241 | 15 | 86 |
| 185 | 900 | 96.0 | 9.375000 | 6.9 | 32.3 | 1614 | 191 | 77 | 87 |
| 186 | 900 | 108.0 | 8.333333 | 6.9 | 32.3 | 1614 | 188 | 1 | 73 |
| 187 | 900 | 107.0 | 8.411215 | 6.9 | 32.3 | 1614 | 203 | 5 | 45 |
Oversampled Dataset
| distance | time | velocity | ink_visco_cp | surface_tension_dyne_cm | ink _density | line_width | overspray | roughness | |
|---|---|---|---|---|---|---|---|---|---|
| 175 | 900 | 107.0 | 8.411215 | 6.9 | 32.3 | 1614 | 194 | 72 | 76 |
| 176 | 900 | 107.0 | 8.411215 | 6.9 | 32.3 | 1614 | 204 | 5 | 73 |
| 177 | 900 | 108.0 | 8.333333 | 6.9 | 32.3 | 1614 | 191 | 81 | 99 |
| 178 | 900 | 108.0 | 8.333333 | 6.9 | 32.3 | 1614 | 212 | 282 | 72 |
| 179 | 900 | 93.0 | 9.677419 | 6.9 | 32.3 | 1614 | 323 | 47 | 157 |
| 180 | 900 | 93.0 | 9.677419 | 6.9 | 32.3 | 1614 | 305 | 201 | 108 |
| 181 | 900 | 94.0 | 9.574468 | 6.9 | 32.3 | 1614 | 288 | 107 | 85 |
| 185 | 900 | 96.0 | 9.375000 | 6.9 | 32.3 | 1614 | 191 | 77 | 87 |
| 186 | 900 | 108.0 | 8.333333 | 6.9 | 32.3 | 1614 | 188 | 1 | 73 |
| 187 | 900 | 107.0 | 8.411215 | 6.9 | 32.3 | 1614 | 203 | 5 | 45 |
Original Dataset
| distance | time | velocity | ink_visco_cp | surface_tension_dyne_cm | ink _density | line_width | overspray | roughness | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | ||||||||||
Oversampled Dataset
| distance | time | velocity | ink_visco_cp | surface_tension_dyne_cm | ink _density | line_width | overspray | roughness | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 270 | 34.0 | 7.941 | 6.3 | 30.9 | 1517 | 261 | 136 | 141 | 3 |
| 1 | 270 | 34.0 | 7.941 | 6.3 | 30.9 | 1517 | 294 | 12 | 164 | 3 |
| 2 | 300 | 31.0 | 9.677 | 6.3 | 30.9 | 1517 | 218 | 187 | 117 | 3 |
| 4 | 300 | 32.0 | 9.375 | 6.3 | 30.9 | 1517 | 306 | 102 | 145 | 3 |
| 5 | 300 | 32.0 | 9.375 | 6.9 | 32.3 | 1614 | 183 | 107 | 84 | 3 |
| 7 | 300 | 32.0 | 9.375 | 6.9 | 32.3 | 1614 | 254 | 268 | 131 | 3 |
| 8 | 300 | 34.0 | 8.824 | 6.3 | 30.9 | 1517 | 226 | 27 | 95 | 3 |
| 9 | 300 | 34.0 | 8.824 | 6.3 | 30.9 | 1517 | 323 | 8 | 171 | 3 |
| 10 | 300 | 34.0 | 8.824 | 6.9 | 32.3 | 1614 | 219 | 213 | 185 | 3 |
| 11 | 300 | 36.0 | 8.333 | 6.9 | 32.3 | 1614 | 193 | 41 | 71 | 3 |